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Abstract 

We consider an unknown response function / defined on A — [0, l] d , I < d < oo, 
taken at n random uniform design points and observed with Gaussian noise of 
known variance. Given a positive sequence r n — > as n — > oo and a known 
function /q £ Z>2(A), we propose, under general conditions, a unified framework 
for the goodness-of-fit testing problem for testing the null hypothesis H Q : / = f 
against the alternative Hi : f £ J 7 , \\f — f \\ > r n , where T is an ellipsoid in 
the Hilbert space £2 (A) with respect to the tensor product Fourier basis and || • | 
is the norm in Z,2(A). We obtain both rate and sharp asymptotics for the error 
probabilities in the minimax setup. The derived tests are inherently non-adaptive. 

Several illustrative examples are presented. In particular, we consider functions 
belonging to ellipsoids arising from the well-known multidimensional Sobolev and 
tensor product Sobolev norms as well as from the less-known Sloan- Wozniakowski 
norm and a norm constructed from multivariable analytic functions on the complex 
strip. 

Some extensions of the suggested minimax goodness-of-fit testing methodology 
covering the cases of general design schemes with a known product probability 
density function, unknown variance, other basis functions and adaptivity of the 
suggested tests, are also briefly discussed. 
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1 Introduction 



We consider the multivariate nonparametric regression model with a random uniform 
design. More precisely, we observe 

Xi = f(U)+Zi, i = l,...,n, (1.1) 

where t{ are random design points, U E A = [0, l] d , 1 < d < oo. In particular, we as- 
sume that ti = {t^} are (for k = 1, . . . , d and i = 1, . . . , n) independent and identically 

distributed (iid) random variables with a uniform distribution, i.e., W(0, 1). More- 
over, we assume that, conditionally onT n = {£]_,... , t n }, £j are iid Gaussian random 

variables with mean zero and variance r 2 , i.e., *~ A/"(0, r 2 ), where r 2 is assumed to 
be known with < r 2 < oo. 

Given a positive sequence r n — > as n — ► oo and a known function /o £ I j 2 (A), 
where L2(A) is the set of squared-integrable functions on A, we propose, under general 
conditions, a unified framework for the goodness-of-fit testing problem for testing the 
null hypothesis 

H :f = fo (1.2) 

against the alternative 

Hi:feT, ||/ -/o|| >r n , (1.3) 

where F is an ellipsoid in the Hilbert space £2 (A) with respect to the tensor product 
Fourier basis and || • || is the norm in L%(A), (The set T corresponds to a "regularity 
constraint" on the response function /.) 

We are interested in both rate and sharp asymptotics for the error probabilities in 
the minimax setup, i.e., we try to find the maximal rate of convergence of r n — > as 
n — > 00 which provide nontrivial minimax testing, when certain constraints are imposed 
on the regularity of the response function /. 

Although there is a plethora of research work in the literature on the estimation 
problem for response functions / € T in (both univariate and multivariate) nonpara- 
metric regression (under various design schemes), much less attention has been paid to 
the hypotheses testing problem in this model, especially in the multivariate case. This 
work is devoted to the goodness-of-fit testing problem (jl.2p - (jl.3p in the nonparametric 
regression model (jl.lj) . 

Nonparametric goodness-of-fit testing was studied intensively during the last twenty 
years or so; however, main results were obtained for the detection of the response 
function / 6 L2(A), with d = 1, in the 1-variable Gaussian white noise model, i.e., 

dX(t) = f(t)dt + edW(t), t€[0,l], (1.4) 

where W(t) is the standard Wiener process, with the noise level e — > 0. In particular, 
rate and sharp asymptotics for the error probabilities in the minimax setup were ob- 
tained for various classes T of nonparametric alternatives. Moreover, under periodicity, 
the sharp asymptotics are of Gaussian type and are determined by a specific extremal 
problem (see, e.g., 0, 0, Q3|, Wti- 

These results have been extended in part to the density, spectral density, nonpara- 
metric regression and Poisson models for the 1-variable case (see, e.g., [8], |14| . |17| . 
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|18|). Note that, under some regularity constraints, one can formally deduce some re- 
sults for the 1-variable density and nonparametric regression models from results on the 
asymptotic equivalence (in Le Cam sense) of these models to the 1-variable Gaussian 
white noise model (see, e.g., |2j, |26j). 

For the d-variable Gaussian white noise model, we have typically similar separation 
rates with the smoothness parameter a (associated with the "regularity constraint" 
on the response function /) replaced by a = a/d as well as sharp asymptotics of a 
similar type (see |19j). This leads to the "curse of dimensionality" phenomenon when 
d is large (see [2Q]). It was recently shown that one can actually lift the curse of 
dimensionality by using different type of regularity constraints, which are determined 
by the so-called "Sloan- Wozniakowski" norm (see [2Q]). Although, analogously to the 
1-variable case, one can formally deduce, under some stronger regularity constraints, 
some results for the multivariate nonparametric regression models from results on the 
asymptotic equivalence (in Le Cam sense) of these models to the cf-variable Gaussian 
white noise model (see, e.g., |3j, [21]), one cannot apply these results to the tensor 
product Sobolev or Sloan- Wozniakowski type spaces, because there are no asymptotic 
equivalence results as yet for these spaces. 

Rate asymptotics in d- variable parametric regression models were studied in, e.g., 
[2], [H], for testing a parametric model against Lipschitz and Holder classes T of 
alternatives, respectively. On the other hand, rate asymptotics in the multivariate 
regression model, under equispaced design points, were studied in [IJ for the goodness- 
of-fit testing problem (|1.2p - (jl.3j) . under Besov balls T of alternatives. 

The purpose of this paper is to extend some results on the goodness-of-fit testing of 
0) [H], P2]-[ZI] for the d- variable Gaussian white noise model to the goodness-of-fit 
testing problem (|1.2p - (jl.3j) for the multivariate nonparametric regression model (jl.ll) . 
in a unified framework. 

In our study, we use analytic results on an extermal problem for ellipsoids that were 
presented in [T3], [H]-[2T] for the d- variable Gaussian white noise model. These lead 
to the asymptotic efficiency of testing for the multivariate nonparametric regression 
model (jl.ip . similar to the ones that have earlier been obtained, in specific settings, for 
the d- variable Gaussian white noise model, under the standard calibration e = r/y/n. 
However, the machinery of reduction of the hypothesis testing problems to the extermal 
problem is different and, essentially, more difficult, especially for the study of the lower 
bounds. The proposed tests are of different structure as well: they are based on U- 
statistics of increasing dimension. Certainly, this reduction requires some assumptions 
on the basis functions and on the sample size (compare with [6] for estimation problem). 
It is a typical situation for extending results from the Gaussian white noise model to 
other statistical models (e.g., density, spectral density, intensity of a Poisson process 
and so on). 

Several illustrative examples are presented. In particular, we consider functions be- 
longing to the balls under the well-known multidimensional Sobolev and tensor product 
Sobolev norms as well as from the less-known Sloan- Wozniakowski norm and a norm 
constructed from multivariable analytic functions on the complex strip. Some exten- 
sions of the suggested minimax goodness-of-fit testing methodology, covering the cases 
of general design schemes with a known product probability density function, unknown 
variance, other basis functions and adaptivity of the suggested tests, are also briefly 
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discussed. 



2 Preliminaries and assumptions 
2.1 Minimax goodness-of-fit testing 

Consider the multivariate nonparametric regression model (jl.ip . Given a known func- 
tion /o E 1/2 (A), we test the null hypothesis ()1.2j> . i.e., we test ifo '■ f = fo- Given a 
positive sequence r n — > as n — > oo, let 

^(r n ) = {/E^: ||/-/o|| >r n }, 

where .F is an ellipsoid in the Hilbert space .^(A) with respect to the tensor product 
Fourier basis and || • || is the norm in Z/2(A). Consider now the alternative hypothesis 
(|1.3p . i.e., consider Hi : f E J-{r n ). (In what follows, without loss of generality, we 
restrict ourselves to the cases /o = and r = 1.) 

Set X n = {x%, . . . , x n } and recall that T n = {t\, . . . , t n }. Let P n j be the probability 
measure that corresponds to Z n = (X n ,T n ) and denote by E n j the expectation over 
this probability measure. Let tp be a (randomized) test, i.e., a measurable function of 
the observation Z n taking values in [0, 1]: the null hypothesis is rejected with probability 
if)(Z n ) and is accepted with probability 1 — ifj(Z n ). Let 

a(ip) = E nfi tp 
be its type I error probability, and let 

f3(F,r n ,ip) = sup E n j(l-tp) 

be its maximal type II error probability. We consider two criteria of asymptotic opti- 
mality: 

[1] The first one corresponds to the classical Neyman-Pearson criterion. For a E 
(0, 1) we set 

f3(T,r n ,a)= inf (3(F,r n ,ip). 

ip: a(ip)<a 

We call a sequence of tests VVi,« asymptotically minimax if 

a(ip n ,a) <a + o(l), /5(JF, r n , V>n,a) = PiJ 7 , r n ,a) + o(l), 

where o(l) is a sequence tending to zero; here, and in what follows, unless otherwise 
stated, all limits are taken as n — > 00. 

[2] The second one corresponds to the total error probabilities. Let r y(J r ,r n ,'^) be 
the sum of the type I and the maximal type II error probabilities, and let j{J-, r n ) be 
the minimax total error probability, i.e., 

where the infimum is taken over all possible tests. We call a sequence of tests ip n 
asymptotically minimax if 

7(.F, r n ,tp n ) = r n ) + o(l). 
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It is known that (see, e.g., Chapter 2 of [IB]) that 

P(F,r n ,a) G [0,1 -a], 7(^,r„)= inf (a + (3{T, r„, a)) € [0, 1]. 

ae(0,l) 

We consider the problems of rate and sharp asymptotics for the error probabilities 
in the minimax setup. The rate optimality problem corresponds to the study of the 
conditions for which ^(J 7 , r n ) — * 1 and j(J-,r n ) — > and, under the conditions of the 
last relation, to the construction of asymptotically minimax consistent sequences ip n , 
i.e, such that 7 (J 7 , r n , ip n ) — > 0. Often, these conditions correspond to some minimal 
decreasing rates for the sequence r n . Namely, we say that the positive sequence r* = 
r^J 7 ), r* — ► 0, is a separation rate, if 

7(J^,r n ) ->• 1 as r n /r* n -> 0, 

and 

j (J 7 , r n ) — > 0, and f3(iF,r n ,a) — > for any a G (0, 1), as r n /r^ — > 00. 

In other words, it means that, for large n, one can detect all functions in / G J 7 if the 
ratio ?" n /r* is large, whereas if this ratio is small then it is impossible to distinguish 
between the null and the alternative hypothesis, with small minimax total error prob- 
ability. Hence, the rate optimality problem corresponds to finding the separation rates 
r* and to constructing asymptotically minimax consistent sequence of tests. 

On the other hand, the sharp optimality problem corresponds to the study of the 
asymptotics of the quantities l3{iF, r n ,a), ^{J 7 , r n ) (up to vanishing terms) and to the 
construction of asymptotically minimax sequences if) n ,ai ^Pn, respectively. Often, the 
sharp asymptotics are of Gaussian type, i.e., 

[3{T,r n ,a) = <t>(H^ - Un )+o(l), 7 (^,r„) = 2$(-« n ) + o(l), (2.1) 

where is the standard Gaussian distribution function, H^ a ' is its (1 — a)-quantile, i.e., 
$(#(«)) = l- a , and the sequence u n = u^J 7 , r n ) characterizes distinguishability in the 
problem. The separation rates r* are usually determined by the relation u n {T , r*) X 1 
(see, e.g., [14] . [T8]). Hence, the sharp optimality problem corresponds to calculating 
the sequence u n and to constructing asymptotically minimax sequence of tests. 

2.2 Assumptions 

Let L2(A) = L2, £ be a denumerable set, {4>i}i^c be an orthonormal system in L2, 
and L2 C L2 be the closed linear hull of the system {<^z}ze£. For a function / G , let 
9 = {0i}i e/ r be the "generalized" Fourier coefficients with respect to this system, i.e., 
9i = (/, (pi), I £ C, where (•, •) denotes the inner product in L2. 

Let a collection of coefficients {c/}z e £, q > 0, be given. The set of functions J 7 C L2 
under consideration are the ellipsoids with respect to the orthonormal system {4>i}i£c 
with coefficients {q}; 6j c, / G C, i.e., 

r={f ■■ f(t) = j2^i(t), £ c ^ 2 <i}. 

lec lec 
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Let 

M(C) = {I e C : q < C}, N(C) = #M(C), 
where # denotes the cardinality of a set. 

Consider the following set of assumptions: 
(Al) The set N{C) is finite, i.e., 

N(C) < oo V C > 0. 

(A2) The orthonormal system {(f>i}i£c satisfies 

£ tf(t) = N(C) VO0, t€ A. 

(A3) The functions f € J 7 are uniformly bounded in L p (A)-norm for some p > 4, 

i.e., 

3 p > 4 : sup / \f{t)\ p < oo. 

Remark 2.1 Note that assumption (A3) follows from the following stronger condi- 
tion, 

sup (I/Hoc < oo, (2.2) 

where ||/||oo = sup teA |/(*)|. 

3 Rate optimality 

In what follows, the relation A n ~ B n means that A n /B n tends to 1 while the relation 
A n x B n means that there exists constants < c\ < C2 < oo and no large enough such 
that ci < A n /B n < C2 for n > uq. Let also l^ A y be the indicator function of a set A. 

For a sequence C = C n , let M = M(C n ), N = N(C n ). 

Let us introduce an extra assumption. 

(Bl) N = o{n). 

Theorem 1 Let r n — > 0. 

(1) [Lower bounds] Assume (A1)-(A2). Take C n — > oo suc/t i/tailim sup (C n r n ) < 1 
and (Bl) holds. Then 

r n , a) > $(ijW - u„) + o(l), 7 (.F, r n ) > 2$(-« n ) + o(l), 



where 



2 4 

2 n r„ 



t4 = — f . (3.1) 

(2) [Upper bounds] Assume (A1)-(A3). Take C n — ► oo suc/i t/iai (Bl) holds. 
Consider the sequence of tests tp^ = Inj n >H\ based on the U -statistics 

U n = ~ Yl R n(^,Zk), (3.2) 
l<i<k<n 
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where = (xj,ij), i = 1, . . . ,n are the observations, with the kernel 

K n (z\z") = x'x"G n (t,t"), G n (t ,t") = J jj^2 <f>i(t )(f>i(t" ). (3.3) 

Set 

Then, uniformly over H = H n 6 M, 

a(^) <l-*(H) + o(l), 

and, for any c £ (0, 1), uniformly over f £ J 7 and H = H n such that h n (f) > a£f n; 

p{^,r n ^)<9{H-hnU)) + o{l). 

Remark 3.1 We now give some intuition about the suggested [/-statistics used in 
Theorem 1. For testing the null hypothesis Hq : / = in the Gaussian white noise 
model, a natural test statistic is a centered and normalized (under Hq) version of 
the quadratic functional Yliec@l> wnere ®l = §& < t ) i{t)dX{t). The analog of 6i in the 
multivariate nonparametric regression model (jl.ip is given by 6i = n^ 1 Y17=l fiiiti)^ 
which leads to the quadratic functional 

1 n 

^2 Of = ^Y1 XiX k G n (ti,t k ), G n (t',t) =J2Mt')<t>l(t')- 
lec i,k=i lec 

Suppressing now the terms with i = k, a centered and normalized version of this 
quadratic functional corresponds to the [/-statistic defined in (|3.2[) with the kernel 
defined in (13.31). 



Let the sequence C = C n be determined by the "balance equation" 

C*N(C n ) x n 2 . (3.5) 
Observe that, in this case, under (Al), C n — ► oo and, hence, N(C n ) — > oo. 



Remark 3.2 Note that if r n satisfies C n r n x 1, then (|3.5p corresponds to u n x 1 in 
(I3.ip . Corollaries [U and [2] below show a motivation of (13. 5p . 



Let us introduce an extra assumption. 
(B2) For any B > 0, N(C n ) x N(BC n ). 

Note that we can obtain lower bounds for h n (f) from (|3.4p . Indeed, for / £ ^(r n ), 
we have 



"»</) = iffE"?- E "0 -^( r »- c - 2 E c ? 



2 a 2 



> 



2 



(r 2 - C- 2 ) = -=§= (1 - (r n C n r 2 ) ■ (3.6) 



'2iV V2iV 
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Therefore, if C n r n > B > 1, we have from Theorem [T] (2), 

l3(F,r n ,^)<<S>(H-u n (l-B- 2 ))+o(l), 
with u n determined by f)3. 1 [) . This leads to 

Corollary 1 Let r n 0. Assume (A1)-(A3) and (B1)-(B2). Then 
[1] The separation rates are of the form 

where the sequence C = C n is determined by $3.5\) . 

[2] Moreover, let r n /r^ — > 00. Then, there exists a sequence H = H n —* 00 such 
that the sequence of tests ip^ = ^{U n >H} * s asymptotically minimax consistent, i.e., 
7(^,r n ,^)-0. 

We say that a function g(t), t > 0, is a slowly varying function if g(Bt)/g(t) tends 
to 1 as t — > 00, for any B > 0. 

This leads to the following assumption. 

(B3) N(C n ) is a slowly varying function. 

Corollary 2 Let r n -> 0. Assume (A1)-(A3) and (B1)-(B3). Then 

[1] The sharp asymptotics 12.1]) hold, where u n is defined by \3. 1\) with any N{C n ) 
determined by \3. 5\) . 

[2] Moreover, for any sequence C n satisfying h3. 5\) . there exists a sequence B n — > 00 
such that, for the sequence C n ^\ = B n C n , the sequence of tests if^n^ * s asymptoti- 

u /2 

cally minimax under the Neyman-Pearson criterion, and the sequence of tests ip n n is 
asymptotically minimax under the total error probability criterion. 

Proof. In order to get the upper bounds, note that under (B3) one can take a sequence 
B n — > 00 such that N(B n C n ) ~ N(C n ). Applying Theorem Q] (2) for the sequence 
C n: i = B n C n , and for H = and H = u n /2, and recalling (|3.6p . we obtain 

inf M/)>«n(l + °(l)). 

By Corollary 2 (2) now follows. 

In order to get the lower bounds, observe first that asymptotics of u n do not depend 
on a sequence C n involved in (|3.5[) . In fact, if C n fl is another sequence applicable to 
(j33j) . then C nfl ~ B n C n , B„xl and, under (B3), we have N(C nfi ) ~ N(C n ). Fix 
now a sequence C n in (13. 5p . It suffices to consider the x 1, which corresponds 

to having r n C n ~ A n x 1. By taking another sequence C n fl = B n C n , B n ~ (2A n ) _1 , 
we get r n C nt o ~ 1/2. Applying Theorem Q] (1), Corollary 2 (1) now follows. This 
completes the proof of Corollary 2. □ 
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4 Sharp optimality 
4.1 Extremal problem 

In order to describe the sharp asymptotics similar to [H], [18], we have to consider an 
extremal problem on the space of collections v = {v{\i^c- 

Assume that r n —* 0. For b = b n x 1, B = B n x 1, by following arguments similar 
to those in Chapter 4 of [18J, we arrive at 



o£V n (b,B) 2 Jt/ , 



V n (b,B) = L : > n(Br n ) 2 , ^cfvf < nb 2 \ . 

^ lec lec ' 



(4.2) 



Let u n (B) = u n (l,B) and u n = u n (l, 1). From Proposition 2.8 of [E], it follows that 
ii 2 (&, 1?) is a convex function in (b 2 ,B 2 ) and, from rescaling arguments, it is easily seen 
that u 2 n (b,B) = b 4 u 2 n {B/b). 

By using Lagrange multipliers, the extremal collection v n = {w/ jn }ze£ m flUD is °f 
the form vf n = Zq(1 — (q/C) 2 )+, where a+ = max(0, a) for any real number a, and the 
quantities zq = z n fi(b,B) > 0, C = C n (b,B) are determined by the equations 

= *3 EC 1 " (Q/C) 2 ) = n(i?r n ) 2 , (4.3) 
ze£ c ; <c* 

E C ^n = *8 E C '(! " ^l°) 2 ) = ( 44 ) 

ze£ c t <c 
while the value of the extremal problem is 

Let 



l&C ci<C 



h = E^-^) 2 )' ^o = E( 1 -( c '/ c ) 2 ) 2 ' 

is A/" ieA/" 
la = £(q/C) a (l - (q/C) a ). 

It is easily seen that the equations (|4.3I )— (|4.5I) can be rewritten in the form 

z 2 h = n(Br n ) 2 , C 2 z 2 I 2 =nb 2 , u 2 n (b, B) = ^J = , (4.6) 

Observe that Ii = Iq + 12 > -^2 and 

° 2 = ^ b2 ( Br nr 2 - 00 as r n ^0. 

Under (Al), this yields A — ► 00. Moreover, one has 

(3/4)A(C/2) < Ji < JV(C), (3/4) 2 A(C/2) < I < N(C). 
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Hence, under (B2), these yield 



2 2 4 

2 _ nr n 2 /, m _ n r n 



hxIo^N, 4-^, <(6,B)x-^. (4.7) 

Introduce the additional assumption 

(CI) For all B = B n x 1, it n (fi) x it n . 

Note that, under assumption (CI), we get 

n 2 (6, -B) ~ n 2 as & = o n — ► 1, J3 = -B„ — > 1. 
(compare with Propositions 2.8 and 5.6 in [T8]). 

4.2 Sharp asymptotics 

Theorem 2 Zei r n — > 0. 

(%) /Lower bounds] Assume fAlj-fA2j, (Blj-(B2j and (Cl,). T/ien 

0(F,r n ,a) >3>(tf (a) -u n ) + o(l), 7 (.F, r„) > 2$(-u n /2) + o(l), (4.8) 



where u n is the value of the extremal problem \4- i| ), ( f^.£| ) /or 6 = = 1. 

[Upper bounds] Assume (A1)-(A3) and (B1)-(B2). Let liminfu n > 0. Con- 
sider t/ie sequence of tests tp^ = ^{U n >H} based on the U -statistics 



U n = - y~] K n (zi,z k ), 



n 

l<j<A:<n 



where z\ = (xj,ij), i = 1, . . . ,n, are the observations, with the kernel 

K n (z',z")=x'x"G n (t',t"), G n (t',t") = J2 w n,lHt)Mi"), ( 4 -9) 

where w nj i = vf n /u n and {vi :n } is the extremal sequence of the extremal problem $4.1]) , 
fiJJfy for b = B = 1, or, equivalently, 

w n ,l = (1 - ( Cl /Cf) + / Wn , W \ = \ ^(1 - (q/C) 2 ) 2 . 

Then, uniformly over H = H n E IR, 

a(^)<l-*(H) + o(l), 

and, /or any c E (0, 1), uniformly over H = H n such that u n > ci7 n; 

0(F,r n ,iltf) <$(H-u n )+o(l). (4.10) 

Remark 4.1 Combining (|4.8p and (|4.10p . we see that the sequence of tests with 
/J = i?( a ) is asymptotically minimax under the Neyman- Pearson criterion, i.e., 

a(^ ia) ) < a + o(l), (3(T,r n ,^ (a) ) = $(H^-u n ) + o(l), 

and the sequence of tests ipn with H = u n /2 is asymptotically minimax under the total 
error probability criterion, i.e., 

7 (^,r n ,C" /2 ) = 2$(-n n /2) + (l). 
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5 Tensor product Fourier basis 

Let /Z£° C 2Z°° consists of all sequences I = (h, . . . , Id, . . .) with finite number j 
such that lj ^ 0, and consider the natural embedding 7L d C 2ZJ° : (h,---,ld) - > 
(li, . . . , la, 0, . . .). Let C be an infinite subset of TL^ . 

Consider the tensor product Fourier basis {4>i}iec in ^2, i-e., 

Ut)=\\^l k it k ), t = (t 1 ,...t d ,...)€A, l€C, (5.1) 

k 

where (ftj(u), j G TL, u G [0, 1], is the standard Fourier basis in -^([0, 1]), i-e., 

4>o(u) = 1, = V2cos(2irju), 0_j(u) = \/2 sin(27r/u), j > 0. 

Definition 5.1 A set C is called sign- symmetric if, for all I = . . . ,ld, . . .) £ C, one 
has el = (sih, ■ ■ ■ ,£dld, - - -) G £ for all Ej = ±1. 

Definition 5.2 The collection {/i/}/g£ is called sign- symmetric if the set C is sign- 
symmetric and hi = h £ i for all I G C and e = (ej, . . . , Ed, . . .), £j = ±1. 

(Dl) The set C and the collection of coefficients {q}/ £ £ are sign-symmetric. 

Let us now show that, under assumptions (Al) and (Dl), assumption (A2) holds 
true for the tensor product Fourier basis (|5.ip . Since the set M is sign-symmetric then, 
under assumption (Dl), this follows from the following statement. 

Lemma 5.1 Let M C Hf^ be a finite sign-symmetric set and let {4>i}ieC be the tensor 
product Fourier basis ( 15. Then 

Y^4n{t) = #{M) we A. 

Proof. Consider the presentation M. = L) U M U , where u C IN and Ai u consists of 
I G M such that #{j : lj ^ 0} = m. It suffices to check that, for all u, 

tf(t) = #(M u ) VtG A. 

Clearly, this holds for u = 0. Without loss of generality, assume m = {1, . . . , d}, dGlN. 
Let = {I G M. u '■ lj > V j G u}. Since A4 is sign-symmetric, consists of all 
el, I G M+, e = (ei,... 3 e d ), e k = ±1 and #(M U ) = 2 d #(M+). It suffices then to 
check that, for each I G M+, 

£4(0 = 2 d . 

e 

Consider e k , k = l,...,d, as iid Rademacher random variables, i.e., P{e k = 1) = 
P{e k = —1) = 1/2. Then, by independency, 

d d 

£ k=l k=l 

since E ek 4)l kh {t k ) = (2sm 2 (l k t k ) + 2 cos 2 {l k t k ))/2 = 1. This completes the proof of 
Lemma 15.11 □ 
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Remark 5.1 Note that for the tensor product Fourier basis (|5.1|) . condition (|2.2|) (and, 
hence, assumption (A3)) is fulfilled if 

2 J( V 2 < °°, J(l) = #{j ■ h + 0}. (5.2) 
Indeed, we have sup tg ^ |</>z(i)| = 2 J ®' 2 , and hence 




lec 



6 Examples: rate and sharp asymptotics in various ellip- 
soids 

Let us first give some extra notation. For the function / = YlieC^l^ ^ we se ^ 
ll/llc = Y,iec tftf and let L 2,c ~ if e L 2 '■ \\f\\c < °°} be the Hilbert space with the 
norm || • || c . (Clearly the ellipsoid T is the unit ball in L^.) 

Consider the tensor product Fourier basis (|5,ip . In all examples below, assumption 
(Dl) holds true. Hence, by Lemma 15-H assumption (A2) holds true. It is easily seen 
that assumption (Al) is also fulfilled in all examples below. That the assumption (A3) 
holds also true is discussed in each example separately. 

The first two examples are versions of the classical multidimensional Sobolev norm 
(see pi]). 



6.1 Multidimensional Sobolev norms 

Let A = [0, l] d , d e IN, C = 7L d \ {0}, and let 

d 



cf = J2 |2vr/ fe | 2<T , I € £, a > 0. (6.1) 



k=l 



Then, for a £ IN, the norm ||/|| c corresponds to the sum of cr-derivatives of a 1-periodic 
/ over all variables, i.e., 



d 

2 

k=l 



Ell^/A^H 2 , (6.2) 



where || • || is the norm in ^(A). 

Assumption (A3) is fulfilled for a > d/A by the so-called Sobolev embedding theo- 
rem (see Eq. (3.2.20) of 0). 

Let now 



\^{2irl k f^ , leC, a>0. 



cf = I \ \2-Kl k f ) , I £ C, a > 0. (6.3) 
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Then, for a G IN, the norm ||/|| c corresponds to the sum of all the derivatives of a 
1-periodic / of order a, i.e., 



d 

2 



£...£l|ff7/ftii---«y a - (M 



ii=l V = l 



Certainly, the norms (|6.2p and (|6.4p are equivalent for any fixed d since the ratio of 
coefficients in (16. ip and (|6.3h is bounded and away from 0. Hence, assumption (A3) is 
fulfilled for a > d/4. 

It was shown in [19J that 

N(C)~C d/<T J k {d,<j), k = 1,2, 

(e.g., k = 1 corresponds to (6.1) and (6.2), and k = 2 corresponds to (6.3) and (6.4)), 
where 

r rf (i + 1/2(7) i 

Jl(d,<T) - 7^(1 + d/2*)' J2(d ' a) = 2^T(lT^)' 
Using equation (|3.5p . these yield 

C x n 2<7 /( 4<7+d > , jV(C) x n 2 ^ 4 ^) . 

Hence, assumption (B2) is fulfilled while assumption (Bl) is fulfilled for a > d/4. 
Thus, we obtain the separation rates 

r * = n -2 CT /(4a+d)_ 

For the sharp asymptotics, it was shown that 

u 2 n ~C k (d,a)n 2 r* +d /°, k = 1,2, 

where, for the norm (|6,2p . 

TT d (l + 2<r/d)r(l + d/2a) 



Ci(d,a) 



(1 + Aa/d) 1 + d l 2a T d {l + l/2a) ' 
and for the norm (|6.4p . 

vr rf (l + 2a/(i)r(l+(i/2) 
21 ' J (1 + 4a/d) 1 + d / 2 -r rf (3/2) • 

Assumption (CI) is thus fulfilled. Hence, we arrive at (|2.ip . 

The next two examples correspond to tensor product norms in ANOVA modeling. 
These spaces are capable of dealing with interactions of all orders in a flexible way, 
thus vastly extending the classical additive methodology in multivariate nonparametric 
regression inference (see 
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6.2 Tensor product Sobolev norm 

Let A = [0, l] d , d £ IN, C = 7L d , and let 

ci = J] l 2 ^'!"' I e A co r ..,o = 1. (6.5) 

For a a £ IN, this corresponds to the following (see [25]). Let us consider the functional 
orthogonal ANOVA expansion 



/(*) 



y)/«(*u), / fu(tu)dt k = Q Vk£u, (6.6) 

•/A 



where the sum is taken over all subsets u = {ji, . . . j m } C {1, . . . , d}, 1 < ji < ... < 
jm — d} and t u = {tj 1 , . . . , tj m }, if u = 0, then /„ = constant = J\ f(t)dt. Then, 



H-^'llc.u' 



where ||/ u || c ,u is the norm of mixed mcr-derivatives of a 1-periodic f u , i.e., 

\\fu\\c,u = \\d m ' T f/dt? 1 ...dt°J. (6.7) 

Assumption (A3) is fulfilled for a > 1/4, using appropriate embedding properties (see 
Chapter III of (30]). 

It was shown in [21] that 

nw TrV^rCd) ' { 1 

Using equation (13. 5ft . this yields 

2 \ ff/(4<x+l) 

C ' 



log d_1 (n 

Hence, assumption (B2) is fulfilled while assumption (Bl) is fulfilled for a > 1/4. 
Thus, we obtain the separation rates 

For the sharp asymptotics, it was shown that 

2 C(d, a)n 2 rt +1/cT , , 

log 01 x (m ) 

where 



, 26(o- r(d) vro- d , , . 2a + 1 . . 

C(d o-) = ; ' v ;v / , b(a) = . 6.10 

v ; (l + 4o- fc ( CT ) w 2<r v ; 



Assumption (CI) is thus fulfilled. Hence, we arrive at (|2.1 
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6.3 ANOVA subspaces 



Let A = [0, l] d , d £ IN. Taking m £ {0, 1, ... , d}, let £^ be the set that consists of 
I £ K d such that #{k : l k ^ 0} = m, and £ d < m = ©!?l £j. Under (ESD, the spaces 

L 2 m and ' consist of the functions 

/(*)= E /(*)= E 

respectively, i.e., they consist of sums of functions of m variables or no more than m 
variables. If m = 0, this corresponds to the constant function while the case m = 1 
corresponds to functions with an additive structure. Take cj according to (|6.5p . Then, 
we obtain, 

imi c 2 = E n/«iii«. n/ii c 2 = E n/«ii£«> 

u:#(u)=m u:#(«)<m 

respectively, where, for cr £ IN, the norm [|/ u [| C) u of a 1-periodic / u is determined by 
(j6.7|) (see [SS]). Assumption (A3) is fulfilled for a > 1/4, since the spaces presented 
here are subspaces of the tensor product Sobolev spaces discussed in Section 6.3. 

Take q according to (16. 5ft . Denote by N<i{C) the function N(C) for the tensor 
product Sobolev norms, by N^ m {C) the function A r (C) for £ = £ d,m , and by N d n (C) 
the function N(C) for £ = Observe that 



N%(C), N djm (C)=J2(JjNf(C) 



Set M = ( ) and note that M > 1 for < m < d. It was shown in [2T] that, as 
C — > oo, 

AT diTO (C) ~ MiV-(C) ~ MiV m (C) ~ m J^L \ , (6.H) 

ix m o m i l (m) 

the last relation follows from ()6.8|) . For both the cases £^ and £ d,m , using (|3.5p . we 
have 

/ ~2 \ <r/(4ff+l) 

Cx ] , h = n/VM. 

Hence, assumption (B2) is fulfilled while assumption (Bl) is fulfilled for a > 1/4. 
Thus, we obtain the separation rates 

log m - 1 (n) ^ /(4 ' T+1) 
n 2 

Let u n ,d be the quantities that determine the sharp asymptotics for the tensor product 
Sobolev norms with sharp asymptotics (|6.9[) . Using (16. lip , we obtain, for both cases, 
the sharp asymptotics 

2 <^ C(m,<r)n 2 r* +1/g 

n M Mlog™- 1 ^ 1 ) ' [b ' 

where the constant C(m,a) is defined by (|6.1(jp . (Note that (|6.12p corresponds, in the 
case m < d, to some loss of efficiency compared to (|6.9|) . since the sample size n is now 
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reduced by the factor M 1 ' 2 > 1.) Assumption (CI) is thus fulfilled. Hence, we arrive 
at (EH)- 



The next example corresponds to classical multivariable analytic functions on the 
complex strip (see [23], 



6.4 Multivariable analytic functions on the complex strip 

Let A = [0, l] d , del, C = 7L d and, for k > 0, let 

d 

cf = Y\ COSh(27TK/fc), I € C. 
k=l 

This corresponds to analytic functions / that provide periodic extensions to the complex 
c?-dimensional strip (t\ + iu\, . . . ,td + iua), |tt&| < k (i.e., of size 2k), and 

\2 n -d II tl i „ ...Ml 2 



This case is closely related to the case 

( d \ 
cf = exp 2ttk \h\ ) I £ £ 

V fc=i / 

(see [24]). Using e'^' /2 < cosh(:r) < & x \ condition (|2.2p is fulfilled for any k > (by 
Remark 5.1), since 

/ oo \ d 

^ 2 J(/) C/ ~ 2 < 2 d ^ q 2 I 1 + 2 exp(2vTKA;) J < oo. 
lec lec V fe=i / 

Thus, assumption (A3) is fulfilled. 
It was shown in |21| that 



{ ' (TTK) d r(d+V 

Using equation (|3.5|) . this yields 

r „ ^ 2 

(log(n)) d / 4 ' 

Hence, assumptions (Bl), (B2) are fulfilled; moreover N{C) is a slowly varying func- 
tion, i.e., assumption (B3) is also fulfilled. Thus, we get the separation rates 

. (log(n)) d / 4 

and the sharp asymptotics 



u 2 ~ 



(7TK) d r(d + l)n 2 r 4 



21og d (n) 

Assumption (CI) is thus fulfilled. Hence, we arrive at (|2.ip . 

The last example corresponds to an infinitely dimensional extension of the ANOVA 
decomposition, that was first suggested to lift the curse of dimensionality in high- 
dimensional numerical integration (see [23], [28], [32J). 
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6.5 Sloan- Wozniakowski norm 

Let A = [0, 1]°°, C = Taking a > 0, s > 0, let 

q= Y[ i S | 27r ^T> / G £, s > 0, cr > 0, c ,...,o,... = 1- 

This corresponds to an infinite tensor product of weighed Hilbert spaces. Under an 
infinite-dimensional ANOVA expansion, 



u, 



/(*)=£/«(*«), / fu(t u )dt k = Vk£ 
where the sum is taken over all finite subsets u C IN, we obtain 

and, for a G IN, the norm ||/ u ||c U of a 1-periodic /„ is determined by (|6.7p (see [20] and 
compare with [23], [28], [32]). 

Contrary to the previous examples, we are not aware of any embedding theorems 
for spaces of the Sloan- Wozniakowski type, and hence we cannot verify Assumption 
(A3) under minimal smoothness conditions (like a* = min(<7, s) > 1/4). However, 
condition (|2.2p . which leads to the Assumption (A3), is fulfilled for a* > 1/2. Indeed, 
let {xkj), k G 7L, 1 < j < d, be a matrix. Applying the formula 

d d 

e n \, = n e x *,h i={h,...,i d )e n\ 

l<=LTL d 3=^ j=lle2Z 

to the matrix entries 

fl, fc = 0, 

Xk >J ]2j- 2s \2Ttk\- 2 ° , k^0, 

and letting d — > oo, we get, for a > 1/2 and s > 1/2, 



E 2J( V 2 = E II *r*\**i 



-2a 

3\ 



= \l + 2j- 2s ^\2-Kk\- 2(7 <oo; K = 2Z\{0}. 

Thus, by Remark 5.1, assumption (A3) is fulfilled for a* > 1/2. 

For simplicity, we consider below only the case a ^ s. It was shown in [20] that if 
< a < s then 

N(C) ~ A 1 C 1/a exp(A 2 (logC) a/{a+s) )(logC)- A \ 
and that if < s < a then 

N(C) ~ 5 1 C 1 / s exp(S 2 (logC) 1 / 2 )(logC)- B3 , 
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where Ai, i = 1, 2, and Bi, i = 1,2, 3, are positive constants which only depend on cr, s. 

bllowin 

log(C) 



Recall that a* = min(s, a). Then, we get the following log-asymptotics 



log(iV(C)) 



which correspond to the Sobolev norms for d = 1 and a = a* . 

It also follows that assumption (B2) is fulfilled while assumption (Bl) is fulfilled 
for a* > 1/4. The separation rates are of the following form. If < a < s, then 

r* x n -2-/(4<x+i) exp (c 1 (log(n)) <7 ^ s+a ^ (log(n))~ C2 , 

and if < s < a, then 

r* x n~ 2s /( 4s+1 ) exp (D iy /l^n)\ {log(n))" D \ 

These yield the following log-asymptotics 

2a* log(n) 

MO ~ -^r+r ■ 

The sharp asymptotics are of the following form. If < a < s, then 

u 2 n ~ C3n 2 r 4+1 /<x exp (-^(logr- 1 )-/^)) (logr" 1 )^. 
If < s < cr, then 

n 2 ~ L> 3 n 2 r 4+1 / S exp (-^Vlogr^ 1 ^ (logr- 1 ) 3 / 4 , 

where Cj, i = 1, . . . , 5, and Di, i = 1, . . . ,4, are positive constants which only depend 
on cr, s. Thus, assumption (CI) is fulfilled. Hence, we arrive at (|2.1|) . 



7 Some General Remarks 

In this section, we discuss how the main results, established in Theorems [T] and [5] 
(and, hence, Corollaries 1 and 2) can be extended to more general settings, involving 
non-uniform design schemes and unknown variances. Some remarks about adaptivity 
issues are also presented. We also present other, than the Fourier basis and its tensor 
product version, examples of basis functions that satisfy assumption (A2), and reveal 
how assumption (A2) can be replaced by a weaker assumption at the cost of replacing 
assumption (Bl) with a slightly stronger assumption. 



7.1 General random design schemes 

The main results, established in Theorems Q] and [21 are evidently extended to ran- 
dom design points y = (y 1 , . . . ,y d ) G IR d , d > 1, with a known product probability 
density function, p(y) = p\{y l ) x ... x Pd(y d ), by applying the coordinates Smirnov 
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transform, i.e., y — ► F(y) = (Fi(y 1 ), . . . , F<i{y d )) G A = [0, l] d , where is the cu- 
mulative distribution function corresponding to the probability density function p^. 
Indeed, consider the goodness-of-fit testing problem for testing the null hypothesis 
Ho ■ f = against the alternative Hi : f G Tp : ||/||2,p > r n , where Tp consists 
of functions defined on IR d and which have the form g(y) = f(F(y)), y G TR d , with 
g G T and ||/|| 2 ,p = {f R d f 2 (y)p(y)d{y)) 1/2 ; note that, in this case, ||/|| 2 ,p = Ibll- 
The corresponding test statistics are now based on the kernels (|3.3j) and (14. 9p with 
t = (t 1 , . . . ,t d ) replaced by F(y) = (F\(y 1 ), . . . ,Fd(y d )) (compare with [15]). 

We conjecture that the main results, established in Theorems [T] and [21 can be 
also extended, subject to some additional constraints similar to [15], to unknown 
product probability density functions by replacing F(y) = (Fi(y 1 ), . . . , Fd{y d )) with 
F n {y) = (F n< i(y 1 ), . . . , F n ^(y d )) in the appropriate test statistics, where F n ^ is the 
empirical distribution function corresponding to F^ for the design points yf,...,y*; 
this development is, however, outside the scope of this paper. 

7.2 Unknown variance 

The results obtained in Theorems Q] and [2] are evidently true when £j *~ AA(0, 1) is re- 
placed by £j *~ M(0, t 2 ), where r 2 is a known variance with < r 2 < oo, by multiplying 
u n by the factor t~ 2 and multiplying r* by the factor r, for the lower bounds, and by 
multiplying the kernels (13. 3p and (14. 9p by the factor r~ 2 , for the upper bounds. 

For an unknown variance r 2 with < f3\ < r 2 < /?2 < oo, we replace the multiplica- 
tive factor t~ 2 appeared in the kernels f)3.3() and (|4.9j) by r~ 2 , where r 2 = X^2=i x f • It 
is easily seen that 

F nJ rl = r 2 + Il/H 2 , Var nJ T; 2 = - ||/|| 4 + 4r 2 ||/|| 2 + 2r 4 ) = o(l), 

the latter being true from assumption (A3). These yield r 2 ~ (r 2 + ||/|| 2 ), in P n ,f- 
probability, which makes possible to repeat all the arguments presented in Appendix 
2 (observe that, in Appendix 2, ||/|| 2 = o(l) for "least favorable" alternative functions 

/en 

The above observations indicated that the main results established in Theorems [T] 
and [2] still remain true when the variance r 2 is either known or, when unknown, is 
replaced by an appropriate estimator as the one considered above. 

7.3 Adaptivity 

Typically, the smoothness parameter (cr for Sobolev norms, k for analytic function, 
min(cj, s) for Sloan- Wozniakowski norms) is unknown. This leads to the so-called prob- 
lem of adaptivity: one has to construct a test procedure that provides the best minimax 
efficiency (separation rates or sharp asymptotics) for a wide range of values of the un- 
known smoothness parameter. This problem was first studied in |29|, and further 
developed in Chapter 7 of [18], for the 1-variable Gaussian white noise model. The 
idea is to use the Bonferroni procedure, i.e., to combine a collection of tests for a suit- 
able grid in a region of the unknown smoothness parameter. It was shown in [18] and 
[29] that this procedure provides an asymptotically minimax adaptive testing with a 
small loss (one gets an additional (but unavoidable) log log (e^ 1 ) factor in the separation 
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rates). We conjecture that these ideas of adaptivity could be also developed for the 
multivariate nonparametric regression models considered in this paper but the exact 
details should be carefully addressed; this development is, however, outside the scope 
of this paper. 

7.4 Other examples of basis functions satisfying Assumption (A2) 

(a) (Haar basis): Let 4>jk(t), j = 0,1,..., k = 1,...,2 J , t £ [0,1], be the standard 
Haar orthonormal system on [0, 1] (see, e.g., Chapter 7 of [31]), where j is the scale 
parameter and k is the shift parameter. Note that, in this case, J2 k <^(i) = 2 J , for each 
resolution j. Consider now the tensor product version of the Haar basis on A = [0, l] d , 
d > 1, and consider coefficients q = Cj, I = k\), . . . , (jd, kd)), that only depend on 
the scale parameter j = (ji, ■ ■ ■ ,jd) and not on the shift parameter k = (A?i, . . . , kd)- 
Hence, by working along the lines of Section 5, it follows that the tensor product Haar 
basis functions on A satisfy Assumption (A2). 

(b) ( Walsh basis): Let 4>j(t), j = 0, 1, . . ., t & [0, 1], be the Walsh basis functions system 
on [0,1]; the Walsh basis functions take actually sums and differences of the Haar basis 
functions to obtain a complete orthonormal system (see, e.g., Chapter 7 of [3T]). Note 
that, in this case, |^>j(x)| = 1, for each j. Consider now the tensor product version of 
the Walsh basis functions on A = [0, l} d , d > 1. Hence, it follows immediately that the 
tensor product Walsh basis functions on A satisfy Assumption (A2). 

(c) (Orthonomal basis on a compact connected Riemannian manifold without boundary): 
Let S be a compact connected Riemannian manifold without boundary and consider 
the orthonormal system of eigenfunctions cj>jk(x), x £ S, associated with the Laplacian 
(Laplace-Beltrami operator) on S, for different eigenvalues \j, X\ < A2 < ... with 
Xj — > 00 as j — > 00 (see, e.g., [3]). For each j = 1,2,..., they satisfy the relation 

X)fcLi(^j fc( x ) — Z 2 " 1 ^)) = 0, where kj < 00 is the (algebraic) multiplicity of the 
eigenvalue Xj and [i is the invariant measure on S (see, e.g., formula (3.18), p. 127 of 
[6], or the last line of p. 1256 of [1]). The above relation is a natural and deep extension 
of the classical relation sin 2 (x) + cos 2 (x) = 1 for the 1-dimensional circle. Similar to (a), 
consider now coefficients cy fc) = Cj or corresponding coefficients q = Cj for the tensor 
product basis functions on S d , d > 1. Hence, by working along the lines of Section 
5, it follows that the tensor product basis functions on S d satisfy Assumption (A2). 
Therefore, our general framework could be a platform to derive analogous statements to 
the ones given in Theorems [Hand [2] for minimax goodness-of-fit testing in nonparametric 
regression problems on compact connected Riemannian manifolds without boundary, 
S, or their products, S d , but the details in the derivation of these statements should 
be carefully addressed; this development is, however, outside the scope of this paper. 

7.5 Replacing assumption (A2) by a weaker assumption 
Assumption (A2) can be replaced by the weaker assumption 

(A2a) sup Yl <Pht) = 0{N(C)) as C -> 00, 
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(it covers the cosines orthonormal system, compactly supported (other than the Haar 
basis) orthonormal wavelet systems, as well as their tensor product versions) by replac- 
ing assumption (Bl) with the slightly stronger assumption 

(Bla) N = o(n 2/3 ). 

Indeed, the only difference in the proofs of Theorems [T] and [2] is in the relation (|8.9p . 
In particular, one can use the Cauchy-Schwarz inequality which yields an additional 
factor N, and this is compensated by assumption (Bla). 

8 Appendix 1: proof of lower bounds 

Let us start with some extra notation. Recall first that X n = {x±, . . . ,x n }, T n = 
{t\, . . . ,t n }, Z n = (X n ,T n ), and Z{ = (xi,ti), and that P n j is the probability measure 
that corresponds to Z n whereas E n j is the expectation over this probability mea- 
sure. Denote also by Var n j the corresponding variance. Let P n ^ be the probabil- 
ity measure that corresponds to T n and P£ * be the conditional probability measure 
with respect to T n . We denote by E n ^ and E^ j the expectations over these prob- 
ability measures, whereas Var n) T, Var^ ^ are the corresponding variances. (Clearly, 
Enj(-) = E n>T Elj{-).) Also, for the function / = Y,A$h 

we denote the measure P n j 

by P n .e, with analogous notation for the expectations, conditional expectations and 
variances. Let also En and Var^ be the expectation and variance of the conditional 
probability measure with respect to 3 n = . . . , £ n }, where £j*~AA(0, 1). Certainly, 

Pn£ = Pnfl- 

8.1 Lower bounds for Theorem [2] 
8.1.1 Priors 

We use the constructions similar to [7j and follow, but with necessary modifications, 
techniques from |14j-|18j. It suffices to consider the case 

u 2 n x 1. (8.1) 

Take 5 G (0, 1), let a^ n = v^ n (b, B) be the extremal collection for the extremal problem 
(fi~Tj) . (f42|) with 6=1 — 6,B = 1 + 6, and let A = A n be the diagonal matrix with 
diagonal elements ai = ai n , I G AT. 

Under (pUl) . using (CI), (STTJ), we have 

u 2 n (b,B) = lY, a tn ~1, D n = Nm a xal n ~z*N>,l. (8.2) 

Let v = ^JnO and let 7r n (dv) be the Gaussian prior N(0,A 2 ) on the parametric space 
consisting of {vi}i & c = \/^{$z}ze£> i- e -> v i are independent in / and, for each I, vi ~ 
A^(0, of) for q < C and V[ = for q > C, in 7r n -probability. 

Note that, in the sequence space of the "generalized" Fourier coefficients = {#/}; g £ 
with respect to the orthonormal system {4>i}ieC, the null hypothesis (jl.2j) (recall that 
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/o = 0) corresponds to H : 6 = and, assuming f £ J 7 , the alternative hypothesis 
(jl.3p corresponds to 

Hi: £<M<1, J> 2 >ri (8.3) 

Let V n = V n (l, 1) be the set determined by (|4.2p with i? = 6 = 1; this corresponds 
to the alternative set (18.31). 



Lemma 8.1 For any 5 G (0, 1), one has Tr n (V n ) = 1 + o(l). 

Proof of Lemma 18.11 It follows from evaluations of 7r n -expectations and variances 
of the random variables TLi(v) = Yli^M v f an d ^2 = X)zeA/" c f v i ' an( ^ by using the 
Chebyshev inequality (compare with similar evaluations in [14] . |17j . |18j). □ 

Let /3(P n fi, Ptt„, ol) be the minimal type II error probability for a given level a £ 
(0, 1) and 7 (P n ,o> Pn n ) be the minimal total error probability for testing the simple null 
hypothesis Hq : P = P n fi against the simple Bayesian alternative Hq : P = P nn for the 
mixture P nn (A) = J P n n -i/ 2v (A) TT n (dv). By Lemma ED and using Proposition 2.11 in 
[TBI, we have 



(3(F,r n ,a) > (3(P nfi , P Wn ,a) + o(l), r„) > 7(^,0, P-.J + o(l). 

Hence, it suffices to show that 

P(P n ,o, P* n ,a) > - u n ) + o(l), 7 (P n , , P Wn ) > 2*(-Un/2) + o(l). (8.4) 

In order to obtain (18. 4|) . it suffices to verify that, in P^^-probability, 

\0g(dP 7r JdP nfi ) = -U 2 n /2 + U n Cn + Vn, Vn^O, ( n C ~ Af(0, 1) (8.5) 

(see Section 4.3.1, formula (4.72)). 

8.1.2 Likelihood ratio and correlation matrix 

For f(t) = Y^ieN '^l^ltf) j * ne likelihood ratio is of the form 

= T^f = ex P ( ~ \ v ' Rv + (w,v)s\ = {0i}ieAf, v = ^/n6, 
dPnfi dP^Q V 2 / 

where w = {wi}i e x, wi = w^ n = -j= Y^i=\ x i^l(U), and R is the correlation matrix 

1 " 

R = R n = {rji} j: i e j^, tji = -^2^j(ti)<f>i(tj); 

n i=i 

here, and in Section 9.1.3, (•, -) s denotes the inner product in the sequence space. 
Let Tr(-) be the trace of a square matrix. 
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Lemma 8.2 (1) The matrix R is symmetric and positively semi- defined. Moreover, 
E u ,tR = In, where In = {^jzljjZeA/ - ^ s ^e unit N x N matrix. 
(2) Under [23$ and (Bl), one has 

E n , T Ti(R 2 ) ~ N, (8.6) 
E n , T Tr((R - I N ) 2 ) = o(N), (8.7) 
E ntT Tr(R 4 ) ~ N. (8.8) 

Proof of Lemma 18.21 First, we prove statement (1). For any x = {xj}j^j\f, Xj £ IR, 
one has 



j 

Since {4>l}l£jV is an orthonormal system, 



i n ( V 

i=l \jeJV J 



E n ,Trji = / (j>j(t)(j>i(t)dt = 5ji. 
J A 

Thus, statement (1) follows. 

Now, we prove statement (2). Analogously, we have, using (A2), (Bl), 

E n ,T{rji - 5ji) 2 = Var n>T r j7 = ~ tf{t)(f> 2 (t)dt - 5 2 )j 

= - I <f> 2 j (t)cf> 2 (t)dt-~5 jl 

and 

E n:T Tr((R - I N ) 2 ) = E nA^i ~ < \ [ E 

which yields f[H7f]) . We obtain from ([121) since Tr(i? 2 ) = Tr((R - I N ) 2 ) + Tr(I N ). 
Let us now evaluate £' rij T , Tr(i? 4 ). Let R 2 — \Pji\ j,l^N > 

1 n 



n 

seAf seA/" «,/3=i 



We have 



Tr(^ 4 ) = £ bl 



1 " 

— E E 4 , j{ta)4>s{t a )4>s{tf3)<pl(t l 3)(f)j{t~ ( )<p r (t~ / )cf) r (ts)(l)l(ts 



Tl 
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Observe that 

n 

^2 E n,T {</>j(*a)</>a(ta)0s(^)^(^)</>j(*7)^r(*7)^r(*<5)^(*<5)} 

a,/3,-y,S=l 

:= S4 + S3 + S2 + S1, 
where S4-S1 correspond to the sums (we omit indexes j,l,r,s in notation of S1S4) 
5 4 = 24 Yl 

l<a</3<y<8<n 

* - « ( E + E + E V 

Yl< a =/3<7<5<n l<a</3=7<5<n l<a</3<y=8<n J 

* = 2 f E + E + E V 

yi< a =/3=7<5<n l<a</3=7=<5<n l<a=f3<y=S<n J 

s, = £ . 

l<a=/3=7=<5<n 

By independence of tj, and since {0;} is an orthonormal system, we have 
^4 = C4(n)<5 ? - S1 5 s z(5 :; >5 r /, 

S3 = C 3 (n){5 jr 5 lr <f>j{t)<l? 8 {t)<l>i{t)di + tfjAi J <f> s (t)<f>i(t)<f>j(t)<f>r(t)dt 

S 2 = C 2 (n){d rl fiWfiWhWrWdt + 5 sj $(*)$(t)^(t)&(t)dt 

+ ^(t)^(t)M*)*) QT ^(u)$(«)^( u )d u 

Si = nj^(t)4>l(t)tf.(t)tf(t)dt, 
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where C^(n) ~ n 4 , C^(n) x n 3 , C2(n) x n 2 . Therefore, 



1 ^ C 4 (n) ^ NC 4 (n) 

l,j,s,retf j,seAf JA 

3C 3 (n) /• /^..Y, 3iV 2 C 3 (n) 



/ a (j>?w)* 



n 4 



0(iV 2 /n) 



l,j,s,reAf l,j,s,r£Af J ^ 



Analogously, 



E 5 rJ / <t>%t)<t>l{t)Ut)<t>r{t)dt= / ^ $(t)$(t)$(t) \ dt 

Lis**)'- 1 * 



and 



E ( / a ^(t)^(t)0 { (*)*) ( / a ^{u^Hu^i^du 

l,j,s,reM 



A / VA 



iy2 E ( / MM®*) ( [ M u )Mu)du) =iV 2 E <?l = ^ 3 



Thus, 

^ E ^ 2 = o(^ 3 /- 2 ). 

Combining evaluations above and (Bl) we get (|8.8p : 

Tr(i? 4 ) ~ AT(1 + 0(iV/n + (iV/n) 2 + (iV/n) 3 )) - iV. 
Thus, statement (2) follows. This competes the proof of Lemma l8,2i □ 
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8.1.3 Bayesian likelihood ratio 

Let us now study the Bayesian likelihood ratio. Direct calculation gives 
dP ffB „ dPl e 



dP. 



nfi 



dPl e 1 (I , x \ 

E *"dPT = v^m exp {2 qG V 



(8.10) 



where q = Aw, G = G n = Jjv + A'RA. Let f/ > 0, / G JV, be the eigenvalues of the 
symmetric positively semi-defined matrix D = A' RA = {ajairji}j t i^. Let e\ be the 
eigenvectors of the matrix D and let qi = {q,ei) s , I G C. 
We can now rewrite (|8.10p in the form 

Let ||^4||oo = su P||a;|l<i ll-^ll f° r a generic matrix A. Observe that 

\\Dt 00 =mzxft <Y,ft = MD A ). 

Using the standard relations 

Tr (AC) = Tv(CA) and Tv(A'BA) < ||A||^Tr(B), 
for a symmetric positively semi-defined matrix B, we get the inequalities 
Tr(D 2 ) < ||A||^Tr(i? 2 ) and Tr(L> 4 ) < p||^Tr(i? 4 ). 

By Q, 

\\Af 00 = m^at<D n /N. 
Jointly with (|8.6p and (|8.8|) . the above yields 

-En,T (Tr(D 2 )) = 0(1), E n , T (Tr(D 4 )) = O^ 1 ). 

Hence, 

E n . T (maxlnl) =0(N~ 1 / 4 ). 



Thus, in P n ^-probability, 



I^Hoo =max|f/| =o(l). (8.11) 

leAf 



Using the well-known relations 
(1 + y)" 1 = l-y + o(y) and log(l + y) - y + y 2 /2 = o(y 2 ), as y -> 0, 
we get, with P nj y-probability tending to 1, by (18. lip . 



^ (Tr(Q) - Tr(L>) - Tr(QD) + Tr(D 2 )/2) + o (Tr(QD)) + o (Tr(L> 2 )) 
1 / Tr((3) - Tr(Q-D) - Tr(D 2 )/2) + o ( Tr (QD)) + o (Tr(D 2 )) , (8.12) 
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where 

Q = qq = Azz'A = {ajdiZjZ^jj^fj, Q = Q — D = A{zz — R)A. 
Let us now study the P nj o-distribution of L n . 
Lemma 8.3 In P n fi-probability, 

Tr(QD) = o(l), (8.13) 

Tr(D 2 ) = Tr(A 4 )+o(l), (8.14) 

EnflTv(Q) = 0, (8.15) 

Var n>0 Tr(Q) = 2Tr(A 4 ) + o(l). (8.16) 

Proof of Lemma 18.31 Let <3? = ^~ 1//2 {</>j(ti)}jeA/',i=i,...,n be an N x n-matrix, and set 
£' = . . . , £ n ). Then, in P nj0 -probability, 

R = z = $£, z 'z = £(££') = Jjv. 

Observe that 

< i0 zz' = $ (^ CO = **' = R, 

which yields 

< i0 (Tr(Q)) = 0, El (Tr(QD))=0. (8.17) 
Analogously, using the formula 

Var(Tr( J B^ / ))) = 2Tr{BB'), 

we get 

Vaj£ i0 (Tr(QD)) = Var^^A^'^'AD) = 2Tr{BB'), 
where B = &A 2 <I><S>'A 2 <S>. By Lemma I5T21 and (|8.2p . it is easily seen that 

Tr(BB') = Tr((Ai?A) 4 ) < ||A||^Tr(fl 4 ). 

Using the formula 

Var n , (.) = Var r « i0 (-)) + £ T (V<o(0), 

we get 

Var n , (Tr(QL>)) = o(l), 

which together with ([537]) . yields (18331) . 
To obtain ([533]) . note that 

Tr(D 2 ) = Tr(Z> 2 ) + 2Tr(A 2 Z>) + Tr(A 4 ), Z) = D - A 2 = A(R - I N )A, 

and observe that, by Lemma 18.21 and (|8.2|) . 

Tr(L> 2 ) < HAH^Tr^ - /tv) 2 ) = o(l), (Tr(A 2 D)) 2 < Tr(A 4 )Tr(L> 2 ) = o(l). 

Obviously, (I8T51) follows from (I5TT71) . and (1535]) follows from ([531]) . since 

Var^ (Tr(Q)) = Var£ >0 (Tr(A$££VA)) = 2Tr((A$$'A) 2 ) = 2Tr(I} 2 ). 

This completes the proof of Lemma 18.31 □ 

Let Cn = Tr(Q)/2u n , u 2 n = Tr(A 4 )/2. By LemmaES we rewrite ([535]) in the form 

L n = u n ( n - u 2 J2 + r] n , rj n 0. 
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Lemma 8.4 In P n ^-probability, Q n — > £ ~ M(0, 1). 

Proof of Lemma 18.41 Let us rewrite Tr(Q) in the form 

11 1 n 

-Tr(Q) = -Tr(A*(tf-I)*'A) = -J2w«{g-l) + Yl 

i=l l<i<k<n 

■ = A n + B n , 

where 



W = {w ik }l k=l = &A 2 <S>, Wlk = -Y^ afMWMtk)- 
It is easily seen that En^A n = 0, and by (A2), (1Q) . 

i=i n i=l VsA/" / 



Thus, A n — > in L2(P n ,o) and in P^o-probability. 
The item i? n is degenerate [/-statistic 

B n = - V] W n (ri,rj), ri = (£i,ti) are i.i.d, 

l<i<fc<n 

IV„(rV)=£YX a ^W'), / W n (r',r")P(dr')=0 Vr", 

where P(dr) = A/o,i(<iO x U/\(dt), i.e., £ and i are independent, £ ~ AA(0, 1) and t is 
uniformly distributed on A. 

The statement of Lemma 18.41 follows from the following proposition. 

Proposition 1 In P n ^-probability, the statistics B n are asymptotically M(0,u^). 

Proof of Proposition [TJ Clearly, Ep n0 B n = and, for r\ = r 2 = (£2^2); 

Vm-,.,„l£„) / / W^n,r 2 )P(dn)P(dr 2 ) 

2 

" v 2 



n( 


'n — 


1) 




2n 2 




n[ 


'n — 


1) 




In 2 




n{ 


[n — 


1) 




2n 2 




n[ 


'n — 


1) 


2n 2 



V a 2 af [ [ 0j(ti)^,-(*2)0i (ti)0i(ta)«tti«tt2 



E4 2 

ieA/" 



28 



For n = r 2 = (£ 2) i 2 ), ^3 = &,*3), let 

G n (n,r 2 ) = J W n (n,r 3 )W n (r 2 ,r 3 )P(dr 3 ), 

G n , 2 = f f 8n(r u r2)P(dn)P(dr2), 

W nA = [ [ W^(r 1 ,r 2 )P(dr 1 )P(dr 2 ). 



Using the asymptotic normality of degenerate [/-statistics established in [10J, together 
with Lemma 3.4 in [16], it suffices to verify the conditions 

G n , 2 = o(l), (8.18) 
W nA = o{n 2 ). (8.19) 

We have 

G n (ri,r 2 ) = Ep {d ^ 4t3) £i£ 2 £l Y a l<h(ti)<h(ta) Y °i#f Ck^-fas 

\ leAf jeAf 



= £i& Y af4<M*i)<M*2) / Mh)^(t3)dt 3 = ^ 2 Y4Mh)4>i(t2) 

G n ,2 = Efo&f I I fy)of^(ti)0i(*2)l dhdt 2 = Va^OfF 1 ), 
• /A Ja VzeA" / ZeAf 

which yields (jSTTBj) . Next, 

W ni4 = E &&) 4 J A J A (l>z<M*i)<M*2)l cttxttta 



vZeAf 

2 



< 9 sup Va^(ti)^(t 2 ) / / Va^(^,(t 2 ) dt 1 dt 2 = 0{N), 
h,t 2 eA V, c ., I JaJa \, ckr ) 



since by (A2) and (|8.2p . we have 



sup 

*1,*2GA 



X)of0,(ti)0i(^ 
ZeA/" 



sup V a^f(ti) < max a? sup V $(t x ) = 0{N 1 ' 2 ) 



This implies (|8.19p . This completes the proof of Proposition [TJ Hence, Lemma [8 
follows. □ 

Thus, we obtain (|8,5p which yields (|8.4p . Hence, Theorem [2] (1) follows. □ 
8.2 Lower bounds for Theorem [T] 

The same scheme used in the proof of the lower bounds of Theorem 2 can be also 
employed here. 
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Let C 2 r 2 n < (1 - 5), 5 > 0. It suffices to assume u\ = n 2 r n /2N = O(l). We 
take the Gaussian prior ir n = M(0, A 2 ) that corresponds to the matrix A = a n I^ with 

= nr 2 (l + 5)/N. Recall TL\, TL2 from the proof of Lemma 18. 11 Analogously to the 
proof of Lemma 18.11 we have 



Since, by Chebyshev's inequality, Vax^Hk = o^E^TCk) 2 ), k = 1,2, these yields 



Observe that the relations (|8.2p hold true with zq = a n . Repeating the calculations 
in the proof of the lower bounds of Theorem [21 we arrive at dS3]) with u 2 n = Na n /2 = 
n 2 r n /2N(l + 5) 2 . Since 5 > can be taken arbitrary small, this yields Theorem[T] (1). 
□ 

9 Appendix 2: proof of upper bounds 
9.1 Upper bounds for Theorem [2] 

We consider the test sequence ip^ = l{u n >H} based on the [/-statistics U n with the 
kernel K n (zi, Z2) of the form (|4.9p . 

9.1.1 Type I error 

Observe that K n (zi, Z2) = u~ 1 W n (zi, Z2), where W n is the kernel of the [/-statistics 
mentioned in Proposition [TJ Applying Proposition [H we get 



Var^Hi 
Var 7rn 'H2 



= a 2 n N = nr 2 (l + 5), 

< C 2 a 2 n N <nC 2 rl(l-5) < n, 
= 2a*N = 0(l), 

< 2C A a 4 n N = 0(n 2 /N). 



7Tn(K) = 1 + 0(1). 



u, 



n 




C~AA(0,1). 



This yields 



£ n , (VC) = Pnfl(Un <~H) = 1- $(H) + O(l). 



(9.1) 



9.1.2 Minimax type II error 



By (19. lp we have to verify that 



sup E n j(l-ip n ) 



sup P nJ (U n >H) = <f>(H - u n ) + o(l). 

/e^(r„) 



(9.2) 



For / = ^2 i&c 6^1, let 
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Lemma 9.1 Uniformly over f £ T , 

EnjU n ~ /!„(/), (9.3) 

Var ni/ [/ n = l + 0(||/|| 2 + ||/||4). (9.4) 

Moreover, uniformly over f £ T such that 

11/11 =o(l), ||/|U = o(l) and h n (f) = 0(l), (9.5) 

the statistics U n — h n (f) are asymptotically A/(0, 1), under P n j -probability. 

Remark 9.1 Using Holder's inequality and (A3) with p = 4 + 25, 5 > 0, we get 

||/|ll<||/IH/fc a = 2/(1 + 1/5), b = p/{\ + 5); \\f\\ <\\f\\ v . 

Therefore, under (A3), Lemma 19.11 yields 

supVar nJ [/ n = 0(l) and Var nJ [/ n = 1 + 0(||/|| 2 + ||/|| a ) (9.6) 
ft? 

uniformly over f £ T , and 

U n = h n (f) + ( n , C„^C~AA(0,1), 
uniformly over f £ T such that h n (f) = 0(1) and ||/|| = o(l). 

Proof of Lemma 19.11 Let the function / = n -1 / 2 X^e/Z^'^' Denote z = (x,t) with 
x = f{i) +£, £ and t are independent, ^ ~ A/(0, 1) and t is uniformly distributed on A. 
Since the items of the sum in [/-statistics are identically distributed and uncorrelated, 
we have 

n — 1 

E n ,fU n = ^ — E n jK n (zi,Z2), 

where z\ and Z2 are independent and distributed as z, 

?T 



E n jK n { Zl ,z 2 ) = E nJ x 1 x 2 G n {t l ,t2) = E 1 n f{t l )f{t2)G n (t 1 ,t2) 
= Y, w n,lEl(f(t)Ut)f=n- 1 ^ 



Hence, (|9.3p follows. 

Let us now evaluate the variance. Rewrite the [/-statistics in the form 

U n = [/„, + U n>1 + U n , 2 , (9.7) 

where ^ 

U n ,k = / . K n uyZi,Zj) 
n ' 

l<i<j<n 

are [/-statistics with the kernels K n ^{z\, Z2) of the form 

K n , = te2G n (t U t 2 ), K n:1 = (^f(t 2 )+^2f(t 1 ))G n (t 1 ,t2), 
K n ,2 = f(h)f(t 2 )G n {h,t2), G n (t 1 ,t 2 ) =J2 W n,M t l)M t 2), 

leAf 
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and the items U n> o, U n> i and U n ^ 2 are uncorrelated. Obviously, 

EnjUnfl = E n jU n: i = 0, 

EnjU n ,2 = "^-Y^WnJ [ f(t)<f>l(t)<lt) ~ >*»(/)■ 

Similarly to Proposition [IJ 



w n,l 



Var nJ U nfl ~ - / / G 2 n {ti } t 2 )dt 1 dt 2 = -Y] 
2 JaJa 2 f77r 

Analogously, by (A2) and (|4.7|) . and since max; = 0(1/ N) 
Var n)/ C/ n)1 ~ 2 / / / 2 (ti)G^(ti,t 2 )dtidt 2 



A JA 



2 / (/ 2 (t)^<^(t))dt = 0(||/|| 2 ). 



Next, 



Var ni/ i7 ni2 < / / r(h)f A (h)Gi(t 1 ,t 2 )dt 1 dt 2 =A n . 
J A J A 



Let G n be the integral operator in L 2 (A) associated with the symmetric positively 
semi-defined kernel G n (ti,t 2 ), t\,t 2 E A, and 

||G n ||oo = sup ||G n /|| = maxw n ,i = 0(iV~ 1/2 ). 

Il/ll<i leM 

Observe that, by (A2) and (|477f> . 

G; = sup ^ uv#(t) < JVUGnlloo, G'UG^Iloo = 0(1). 



* eA *eAf 



We have 



A. = / / Hh)4>i(t2)f 2 (h)f 2 (t 2 )G n (t u t 2 )dh 

= Y, W n,l(f<t>l, G„(/ 2 ^)) < IIGnlU Y, ™n,l\\f 

= ||G n |U I Y, w n,l<t>Kt)f\t)dt 
leAf 

< ||G n ||oo SUp ( Y W nt l(f) 2 (t) ] / / 4 (t)di 



= ||GJooG*||/||l = 0(||/||l) 
Hence, follows. 
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Using (|9.7|) . and an evaluation similar to the above under (|9.5|) . we have 

U n - h n {f) = U nfi + U n>1 + U n>2 - h n (f), 

where U n) \ — > 0, U n ^ — h n (f) — > 0, in P n j-probability. By Proposition [TJ the statistics 
U n .o are asymptotically Gaussian M(0, 1). This completes the proof of Lemma |9~T1 □ 

Let h n (f) = 0(1). Let us now evaluate ||/|| 2 , / G T. We have 

\\f\\ 2 = Y.V?--= A 'n + K, K= E °l B 'n= E 9 l 

leC ci<C/2 C(>C/2 

The second sum is controlled by 

B' n <AC- 2 ^cf0f<AC- 2 = o(l). 

The first sum is controlled by 

A' n < (4/3) ^(1 " (ct/CfW? = (4/3)K/n) E ^n,iv 2 n 
leAf leJV 
= (A/3)(w n /n)h n (f) = o(h n (f)), 

since, by (|4.7p and (Bl), we have w n /n = 0(N 1 ^ 2 jn) = o(l). Therefore, by (|9.6p . we 
have in _P n j-probability, 

tfn = M/) + Cn, Cn^C~AA(0,l), 

uniformly as h n (f) = 0(1). 
Lemma 9.2 

inf h n (f) = u n . 

Proof of Lemma 19.21 It follows using general convexity arguments (see [14] , Lemma 
11 of [17J, Proposition 4.1 of [18J). □ 

Let us now evaluate type II errors for a sequence / = f n € J-"(r n ). First, let 
h n (fn) — > oo. Applying Lemmas 19.11 19.21 and (|9.6p . we have 

E nJ (l - Vf) = P nJ (Un <H) = P n j(E nJ -U n > E nJ - H) 
< Vax n j(U n )/(E nJ - Hf = o(l). 

Let h n (f n ) = 0(1) (by Lemma [9.21 this is only possible for u n = 0(1)). Applying 
Lemmas 19.11 19.21 and (19. 6p once again, we have 

E nJ (l-^) = P n j(U n <H) = P n j(E n j-U n >E n j-H) 

= Pn,f(Cn > K(f) ~H + 0(1)) = $(# - h n (f)) + o(l). 

Therefore, 

sup E n j(l-^) = $(H- inf h n (f)) + o(l) = $>(H -u n ) + o(l). 

This yields (19.21) . Hence, Theorem [2] (2) follows. □ 
This completes the proof of Theorem [2j 
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9.2 Upper bounds for Theorem Q] 

Observe that the kernel (|3.3p is of the form (|4.9p with coefficients 

Wl,n = w n = y/2/N, I G AA. 

Hence, Proposition [1] is applicable to the [/-statistics U n with kernel (13, 3D and yields 
asymptotic normality AA(0, 1) of U n under P n fi. Thus, we get (|9.ip . Analogously, we 
obtain Lemma 19 . 1 1 with 

If h n (f) = 0(1), f £ J- , then ll/H = In fact, 

Il/H 2 = £«?<£ ^ + C ~ 2 £ ^ + C ~ 2 = «" 

leC leAf ci>c 

These yield (JOJ) for f E J- such that /i n (/) = 0(1). If h n (f) -> co, then it follows from 
Chebyshev's inequality and the boundness of the variances that P n j(U n > H) — > for 
# < c/i n (/), c e (0, 1). Hence, Theorem □ (2) follows. □ 

This completes the proof of Theorem [TJ 
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